Adaptive Critic Designs - Neural Networks, IEEE Transactions on
نویسنده
چکیده
We discuss a variety of adaptive critic designs (ACD’s) for neurocontrol. These are suitable for learning in noisy, nonlinear, and nonstationary environments. They have common roots as generalizations of dynamic programming for neural reinforcement learning approaches. Our discussion of these origins leads to an explanation of three design families: Heuristic dynamic programming (HDP), dual heuristic programming (DHP), and globalized dual heuristic programming (GDHP). The main emphasis is on DHP and GDHP as advanced ACD’s. We suggest two new modifications of the original GDHP design that are currently the only working implementations of GDHP. They promise to be useful for many engineering applications in the areas of optimization and optimal control. Based on one of these modifications, we present a unified approach to all ACD’s. This leads to a generalized training procedure for ACD’s.
منابع مشابه
Adaptive Critic Learning Techniques for Engine Torque and Air-Fuel Ratio Control
A new approach for engine calibration and control is proposed. In this paper, we present our research results on the implementation of adaptive critic designs for self-learning control of automotive engines. A class of adaptive critic designs that can be classified as (model-free) action-dependent heuristic dynamic programming is used in this research project. The goals of the present learning ...
متن کاملAn ART-based fuzzy adaptive learning control network
This paper proposes a reinforcement fuzzy adaptive learning control network (RFALCON), constructed by integrating two fuzzy adaptive learning control networks (FALCON), each of which has a feedforward multilayer network and is developed for the realization of a fuzzy controller. One FALCON performs as a critic network (fuzzy predictor), the other as an action network (fuzzy controller). Using t...
متن کاملNeurocontroller alternatives for "fuzzy" ball-and-beam systems with nonuniform nonlinear friction
The ball-and-beam problem is a benchmark for testing control algorithms. In the World Congress on Neural Networks, 1994, Prof. L. Zadeh proposed a twist to the problem, which, he suggested, would require a fuzzy logic controller. This experiment uses a beam, partially covered with a sticky substance, increasing the difficulty of predicting the ball's motion. We complicated this problem even mor...
متن کاملA deployed engineering design retrieval system using neural networks
We describe a neural information retrieval system (NIRS), now in production within the Boeing Company, which has been developed for the identification and retrieval of engineering designs. Two-dimensional and three-dimensional representations of engineering designs are input to adaptive resonance theory (ART-1) neural networks to produce clusters of similar parts. The trained networks are then ...
متن کاملIntelligent supply chain management using adaptive critic learning
A set of neural networks is employed to develop control policies that are better than fixed, theoretically optimal policies, when applied to a combined physical inventory and distribution system in a nonstationary demand environment. Specifically, we show that model-based adaptive critic approximate dynamic programming techniques can be used with systems characterized by discrete valued states ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998